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ABSTRACT 

We examine the correlational property of Lya clouds (along the line of 
sight) in detail utilizing a hydrodynamic simulation of Lya clouds in a cold 
dark matter universe with a cosmological constant, and compare it to that of 
mass and galaxies. We show that the correlation strength of Lya clouds is 
somewhat weaker than that of the underlying matter, which in turn should be 
weaker than that of galaxies (biased galaxy formation). On the scales probed, 
10 — 300km/s, for Lya clouds we find that higher density, higher optical depth, 
higher column density regions are more strongly clustered than lower density, 
lower optical depth, lower column density regions, with the difference being 
larger at small separations and smaller at large separations. Thus, a consistent 
picture seems to emerge: the correlation strength for a given set of objects is 
positively correlated with their characteristic global density and the differences 
among the correlations of galaxies, Lya clouds and mass reflect the differences 
in density that each trace. Significant positive correlations with a strength of 
0.1 — 1.0 are found for Lya clouds in the velocity range 50 — 300km/s. This 
effect should be observable. The correlation function of Lya clouds seems to be 
a monotonically decreasing function of separation, indicating that correlation 
strength should be less than 0.1 at Av > 300km/s, where our current simulation 
box is too small to give a reliable measure. 

Among the correlational measures examined, an optical depth correlation 
function (Equation 5) proposed here may serve as the best correlational measure. 
It reasonably faithfully represents the true correlation of the underlying matter, 
enabling a better indication of both matter correlation and the relationship 
between galaxies and Lya clouds. Furthermore, it appears to be an alternative 
to the conventional line-line correlation function with the virtue that it does not 
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require ambiguous post-observation fitting procedures such as those commonly 
employed in the conventional line-finding methods. Neither does it depend 
sensitively on the observational resolution (e.g., FWHM), insofaras the clouds 
are resolved (i.e., the FWHM is smaller than the fine width). Conveniently, 
it can be easily measured with the current observational sensitivity without 
being contaminated significantly by the presence of noise, if one chooses an 
appropriate optical depth fioor value Tmin (an adjustable parameter) say, < 2.0. 

Subject headings: Cosmology: large-scale structure of Universe - cosmology: 
theory - intergalactic medium - quasars: absorption lines - hydrodynamics 
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1. Introduction 

With rapid advances in observational capabilities in recent years, thanks to the 
advent of the ground-based 10m Keck telescopes as well as the Hubble Space Telescope, 
observation of the Lya forest has become a highly useful tool for probing the universe at 
low-to-moderate redshift (e.g., Carswell et al. 1991; Ranch et al. 1992; Petitjean et al. 1993; 
Schneider et al. 1993; Cristiani et al. 1995; Hu et al. 1995; Tytler et al. 1995; Lanzetta 
et al. 1995; Bahcall ct al. 1996). When analyzed in conjunction with progressively more 
sophisticated and accurate theoretical modelling (Cen et al. 1994, CMOR hereafter; Zhang 
et al. 1995; Hernquist et al. 1996; Miralda-Escude et al. 1996, MCOR hereafter), made 
possible by the availability of powerful computers and accurate cosmological hydrodynamic 
codes, observations of Lya clouds should greatly enhance our ability to test cosmological 
models. The above quoted papers show such a good agreement between theory and 
observation with regard to physical properties which have already been documented, that 
we are led to use the theory now to predict properties, such as spatial correlations among 
Lya clouds, that have not yet been reliably measured. 

Early observations that Lya clouds arc very weakly clustered in velocity space on 
scales > 300km/s along the line of sight to distant quasars at high redshift (Sargent et al. 
1980), compared to the expected correlations among galaxies at the same redshift, provided 
the first compelling evidence that Lya clouds at high redshift are perhaps not associated 
with galaxies. This single piece of evidence, combined with the inferred temperature of the 
clouds, has spawned most subsequent discussions about their origin and nature (Ostriker & 
Ikeuchi 1983; Ikeuchi & Ostriker 1986; Rees 1986; Ikeuchi 1986; Bond, Szalay, & Silk 1988). 
On the other hand, the recent observations by HST seem to indicate that Lya clouds at low 
redshift {z < 1.0) may have their origin directly inside or in very close proximity to galaxies 
(Lanzetta et al. 1995; Bahcall et al. 1996). Of course, ground based moderate redshift 
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observations and space based low redshift observations may sample essentially different 
populations, since the high redshift population is declining with decreasing redshift at such 
a large rate as to make a negligible contribution to the HST observed lines. Moreover, 
various studies of the clustering of Lya clouds at high redshift, which primarily measure 
hne correlations along the line of sight, have arrived at different results. For example, Webb 
(1986) suggests that Lya clouds arc weakly clustered with < C, >= 0.32 ± 0.08 over the 
velocity range 50 — 290km/s, while Ranch et al. (1992) find no clustering at any scale for the 
clouds as a whole. Such seemingly conflicting observational evidence at high redshift and 
between high and low redshifts indicate that our current understanding of the clustering 
properties of Lya clouds may be quite incomplete. 

In this paper we attempt to understand better the spatial clustering properties of 
Lya clouds by analyzing the simulated Lya clouds in the cold dark matter model with a 
cosmological constant (ACDM). This model, according to the MCOR analysis, provides 
a good fit to the observed distributions of absorption in the Lya forest as measured by 
the one-point distribution function of transmitted flux, namely the flux distribution and 
the derived column density distribution, but that work did not examine the two-point 
correlation of the Lya clouds in much detail. In the present work several clustering measures 
are used, including a line-line two-point correlation function and measures (some newly 
proposed) directly using flux and optical depth distributions as well as more fundamental 
matter correlations, directly computable in simulations. The paper is organized as follows: 
A brief description of the simulation is given in §2. The correlational measures used to 
analyze the simulation are deflned in §3. Results and conclusions are given in §4 and §5. 
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2. Simulation 

We simulate the formation of Lya clouds in a spatially flat cold dark matter universe 
with a cosmological constant (ACDM), with the following cosmological parameters: Hubble 
constant = 65km/s/Mpc, ^Iq^dm = 0.3645, Aq = 0.6, f2o,b = 0.0355 (cf. Walker et al. 
1991), as — 0.79 (the simulations we use in this paper are the same as those used in CMOR 
and MCOR). The primary motivation for choosing this model is simple: overall it provides 
an excellent fit to available observations (Ostriker & Steinhardt 1995, "Concordance 
model"). The simulation box size is 10/i~^Mpc comoving with — 288^ cells and 144^ 
dark matter particles. The cell size is 35 h^^kpc (comoving) corresponding to an average 
baryonic cell mass of 6.3 x 10^ M©, with the true spatial and mass resolutions being 
about 2 and 8 times worse than those values, respectively. At ^ = 3, the Jeans length, 
Aj = {nc^/Gptoty^^ for Cg — Vrms — lOkm/s, is equal to 400/i~^kpc in comoving units, or 
11 cells. The power spectrum transfer function is computed using the method described in 
Cen, Gnedin, & Ostriker (1993). We use a new shock-capturing Total Variation Diminishing 
(TVD) cosmological hydrodynamic code described in Ryu et al. (1993). 

All the atomic processes for a plasma of (H, He) of primeval composition (76%, 24%) 
in mass are included, using the heating, cooling, and ionization terms described in Cen 
(1992). We calculate self-consistently the average background photoionizing radiation field 
as a function of frequency, assuming the radiation field is spatially uniform (i.e., optically 
thin). The evolution of the radiation field is calculated given the average attenuation in 
the simulated box and the emission (both from the gas itself and from the assumed sources 
of ionizing photons) . The time-dependent equations for the ionization structure of the gas 
are solved by iteration using an implicit method, to avoid the instabilities that arise in 
solving stiff equations. In general, the abundances of different species are close to ionization 
equilibrium between recombination and photoionization after most of the gas has been 
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photoionized. 

We model galaxy formation as in Cen & Ostriker (1992, 1993a,b). The material turning 
into coUisionless particles as "galaxies" is assumed to emit ionizing radiation with two 
types of spectra: one characteristic of star formation regions and the other characteristic 
of quasars, with efficiencies (i.e., the fraction of rest-mass energy converted into radiation) 
of euv,* = 5 X 10~^, and euv,Q = 6 x 10~^, respectively. We adopt the emission spectrum 
of massive stars from Scalo (1986) and that of quasars from Edelson and Malkan (1986). 
These coefficients have been chosen to produce a computed radiation field consistent with 
that obtained from the proximity effect and other observational constraints. Details of how 
we identify galaxy formation and follow the motions of formed galaxies have been described 
in Cen & Ostriker (1993a). Note that in this simulation supernova energy feedback from 
aging massive stars into the intergalactic medium is not included. 



3. Correlation Measures 

We analyze the clustering properties of Lya clouds along the line of sight at ^ = 3 
using six different two-point correlation functions: ^p^{r)i ^p^{t).i Cfiuxiv), Ct,c(i'), Ct,s{v), 
and Ciine{v). They are defined as follows. 

<p^{r + r')p^{r')> _ 

where ^p^(r) is the two-point correlation function (Peebles 1980) of the cold dark matter 
density {x = d) or baryonic density {x — h)-^ Pxij") is the density at point r along the line 
of sight; < > is its mean averaged over all lines of sight sampled. The angular brackets 
in the numerator indicate averaging over all pairs of pixels with the line of sight separation 
between r and r -\- Ar. Note that for ^p^{r) and ^p^{r) we ignore the peculiar velocity along 
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the line of sight so that the resulting correlations reflect the more fundamental matter 
correlations in real space. This fundamental correlational measure will be compared with 
other measures, which are directly observable and calculated in velocity space, including 
the pecuhar velocity as well as the thermal effect. 

The two-point correlation function of the normalized transmitted flux F{v) along the 
line of sight, ^fiuxiv), is defined by 

<F{v + v')F{v')> 

Kilux{v) = y2 ^ ' V2) 

where < F > is its mean averaged over all lines of sight sampled and the angular brackets 
in the numerator indicate averaging over all pairs of pixels with the line of sight separation 
between v and v + Av. Note that this flux correlation measure has a slightly different 
definition than another flux correlation measure used in MCOR (see equation 2 of MCOR 
for definition). 

Next, we propose a new two-point correlation function of (modified) optical depth 
along the hne of sight, defined as 

e„w. <--'7;>f''> -i , (3) 

< Tc > is its mean Tc averaged over all lines of sight sampled the angular brackets in 
the numerator indicate averaging over all pairs of pixels with the line of sight separation 
between v and v + Av. Tc{v) is not the optical depth as observed, but is the latter modified 
as: 



Tc{v) = Mm[Tmax,Tobsiv)] , (4) 

where Tobs{v) is what is directly measured observationally, related to the normalized 
transmitted flux by t{v) = —log[F{v)]; Min() is the minimum operator, which takes the 
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smaller of its two arguments. The letter "c" in the subscript of Tc and ^^-^c is intended to 
stand for "ceihng", imposed on Tabs (equation 4). Because of this notation, we may call ^r,c 
"capped optical depth correlation function" We use three values of Tmax = (2.0, 4.0, oo). 
Note that in the case Tmax — oo, Tc{v) — Tobs{v)- The purpose of choosing several r^ax's 
is to study the dependence of clustering strength on regions with different optical depth, 
different column densities. 

We propose another new two-point correlation function of (yet another modified) 
optical depth along the line of sight, defined as 



<rf{v + v')rf{v')> ^ 

where Tf{v) is defined as 



1, if Tobsiv) > Trrnn 

rf{v) = { (6) 

0, if Tobs{v) < Tmin 

We pick three values of r^m = (1-0, 2.0, 4.0), for a similar intention to study the dependence 
of clustering strength on the different regions but in a somewhat different angle than ^t- ,,. 
The letter "s" in the subscript of t/ and ^r,s is intended to stand for "step" , imposed on Tabs 
(equation 6). We will call (,r,s "stepped optical depth correlation function". 

Finally, we test the usual line- line two-point correlation function along the line of sight, 
defined as 

^ , , _ # of pairs with v —>■ v + Av ^ 

# of pairs for a random sample with v ^ v + Av ' 

We use the method described in MCOR to identify lines. This method is different from 
what is used in extracting lines from observed spectra, so a direct comparison between the 
simulation and observation is not possible at this time. 
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in all cases the global average <> is understood to be the average over our computed 
sample at a given redshift <>z- The purpose of examining several measures of correlation 
in the Lya absorption spectra is threefold. First, we wish to understand the relationship 
between correlations that are directly measured from observed absorption spectra and the 
fundamental correlation of the underlying matter distribution. For this purpose, we have 
included Cpd(^) ^^'^ Cpal^)- Second, we attempt to find a correlational measure which is 
relatively free from post-observation fitting procedures such as line profile fitting and line 
deblending, and is relatively insensitive to small changes in observational accuracy in terms 
of signal to noise ratio and FWHM. For this reason, we have calculated ifiux{v), ^T,ci^) and 
iT,s{v). Each of these three measures is directly based on the observed flux distributions and 
does not require any fitting procedure. Furthermore, we expect that, once the resolution of 
observations becomes higher than the width caused by the expected thermal broadening, 
these measures will be rather robust. This property is in sharp contrast with that of line 
correlations, for which ambiguity exists in deblending lines regardless of how high a signal 
to noise ratio an observation can achieve. Third, we want to understand the implications 
of available observational results obtained by employing the line-line two-point correlation 
function. Therefore, we also examine the simulation using line-line two-point correlation 
function ^uneiv), although a direct comparison with observations is still not possible 
due to differences between the line identification algorithms used in observations and in 
simulations. 

For the first two correlations, and ^p^, no observational noise is added and their 
validity in a specific cosmological model only depends on the accuracy of the simulation, 
which is primarily limited by the simulation resolution and box size (and of course 
additionally by the residual uncertainty in the underlying cosmological model). For the 
remaining four correlational measures we generate two sets of spectra with (S/N=10, 
FWHM=30km/s) and (S/N=50, FWHM=6km/s). A sampling bin of 2km s"^ is used in aU 
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cases. Since we are interested in regions with large amount of absorption (" clouds" ) as well 
as those with mild amount of absorption (weak lines or "voids"), the widely used Gaussian 
statistic of noise is not valid, in general. We use the Poisson statistic of noise, which is valid 
in both low and high flux regions assuming that we know the signal to noise level at each 
pixel of a spectrum and is identical to the Gaussian statistic in the limit of large photon 
numbers. This is done in the following manner. 

First, we require that the average flux decrement in the simulated spectra. 



{D) = 1 — (exp ^Lya^^ equal to the observed value, {D)obs = 0.36 at z = 3 ([Press, Rybicki 



|fc Schneider 199^ ). This is achieved by adjusting the parameter, /i = — ffio.b/o.oisfe — )_ 



(h jjj/io-i2sec-i)V2 

(MCOR), where h is Hubble constant in units of lOOkm/s/Mpc, fio.b is the mean baryonic 
density and Jh is the hydrogen photoionization rate. This normalization procedure for 
the flux distribution is necessary in order to fix its overall amplitude, due to the large 
uncertainties of the observed values of fio,fe and jn- The fitted value of /i is 1.90 for the 
ACDM model. Second, we add noise to the normalized spectrum. By definition, the signal 
to noise ratio at the continuum is 



S/N 



y/ -^src ~l~ -^noise 

where Ngrc and Nnoise are the number of source photons at the continuum and the number 
of noise photons, respectively, per frequency bin. Thus, given S/N and Nnoise, we can solve 
equation (8) to get Ngrc- To simplify the illustration (without loss of generality) we assume 
that Nnoise is dominated by the detector read noise. This is a good approximation only 
for bright quasars where the number of sky photons is small (due to a shorter exposure 
time) compared to the read noise of, say, a CCD detector. For example, the gain of the 
HIRES CCD detector on the Keck telescope is 6.1 electrons, so the number of photons 
due to the CCD read noise integrated over 5 spatial pixels (for each frequency bin) is 
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NccD = 5 X 6.1^ = 186. For SkV — 16.5 mag quasar at 5000A with 1-hr integration time, 
there are about 4 photons from the sky per spatial pixel, giving a total count of sky photons 
per frequency pixel (integrated over the 5 spatial pixels) of only 20 photons (see, e.g., Hu 
et al. 1995). A frequency bin in the synthetic noise- free spectrum with flux / contains 
fNsrc photons. When noise is added, the "observed" number of photons (subtracted by the 
known CCD read noise) in the frequency bin is 



where Poisson{X) means a Poisson distributed random number with the mean equal to X. 
The NccD value of the HIRES CCD detector is adopted in the calculations. 



For each correlation function computed we use 12,000 lines of sight each having a 
length of 1280km/s. Figure 1 shows ^p^{r) (sohd curve) and ^^^(r) (dashed curve). Note 
that the two correlations are computed in real space as a function of proper distance, as 
shown in the bottom horizontal axis. For reference the velocity distance, converted from the 
proper distance using the relation v — r H{z), where H{z) — 512km/s/Mpc is the Hubble 
constant at 2; = 3 for the adopted model, is shown in the top horizontal axis. Note that 1 + ^ 
rather than ^ is shown in the plot in order to display negative values of ^ on a logarithmic 
scale. We see that both dark matter and gas are highly clustered at scales < 0.1/i~^Mpc 
comoving, with dark matter being more strongly clustered than gas on all scales. This 
apparently surprising result that baryonic gas density is anti biased with respect to dark 
matter density has been found in all simulations to date begining with our earliest numerical 
work (Cen 1992). While no completely satisfactory explanation has been given for this fact 



Nohs = Poisson{fNsrc + Nccd) - Nccd, 



(9) 



4. 



Results 
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it presumably reflects the greater instability of the dark matter to waves perpendicular to 
the compression vector. The correlation lengths of the dark matter and gas, defined as the 
length at which ^ is unity, are 0.63/i~^Mpc and 0.46/i~^Mpc comoving, respectively. The 
difference in the clustering between dark matter and gas is enhanced due to the thermal 
photoheating effect in the baryons. It is noted that the resolution of the simulation is 
0.087/i^^Mpc (2.5 cells), so numerical smoothing in the gas can not be responsible for the 
difference seen on scales > 0.1/i~^Mpc. A significant part of the difference may be due to 
the small fraction of baryons (~ 6% hy z — 3) which were in the highest peaks and have 
condensed out of the IGM to form stars irreversibly in the simulation. 

Figure 2 shows the fiux correlation function ^fiux{v) [defined by equation (2)] for 
S/N=50, FWHM=6km/s (sohd curve) and S/N=10, FWHM=30km/s (dashed curve). We 
see that the flux correlation depends on the observational resolution at small separations, 
but does not depend on the observational resolution at large separations. For example, 
for the two cases with the adopted observational parameters, the correlation strengths at 
V = (10, 100)km/s are = (0.20,0.045) and (0.16,0.042), respectively. 

Figure 3 shows the "capped optical depth correlation function" ^t,c{v) [deflned by 
equation (3)] for two sets of observational parameters, S/N=50, FWHM=6km/s (thick 
curves) and S/N=10, FWHM=30km/s (thin curves). For each case three functions for 
three optical depth ceiling values are shown: T^ax — (solid curve), Tmax — 4.0 (dotted 
curve) and Tmax = 2.0 (dashed curve). Three features are to be noted. First, similar 
to the flux correlation function {^fiux, shown in Figure 2), this optical depth correlation 
function only weakly depends on S/N and FWHM on scales > 40km/s. Second, on scales 
< 40km/s its values start to depend more strongly on S/N and FWHM, presumably due to 
the dominance of small, dense regions at small scales, which become under-resolved with 
lower observational resolution and/or sensitivity. Third, the correlation depends strongly 
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on the imposed ceiling value; a lower ceiling value results in a weaker correlation. In other 
words, high optical depth, high density regions are more strongly clustered than low optical 
depth, low density regions on all scales probed here, 1 — 320km/s, with the trend that the 
difference is progressively larger on smaller scales. Note that one side of our simulation box 
spans a velocity range of 1280km/s, so the reader is reminded that the correlation computed 
here is only valid for scales < 320km/s (a quarter of the box size), because of the periodic 
boundary condition. It is noted that the two-point correlation function measured by any 
of the proposed functions here is underestimated due to the missing density waves larger 
than our simulation box (A > 10^~^Mpc). Since ^(r) oc Pkk'^sin(kr)d\nk and Pk ~ 
at A = 10/i~^Mpc (and the fiducial power index becomes progressively larger than -2.0 on 
larger scales) for the adopted model, we expect that the underestimate is of a logarithmic 
factor, perhaps a factor less than two for the computed correlations on scales v < 300 km/s. 

To see separately the effects of thermal broadening and peculiar velocity we show, in 
Figure 4, the corresponding correlations with only the thermal broadening effect (dotted 
curves), with only the peculiar velocity effect (dashed curves) and with both effects (solid 
curves; the same as the thick curves in Figure 3) for the case with S/N=50, FWHM=6km/s. 
For each case three functions for three optical depth ceiling values are shown: Tmax — oo 
(thick curve), Tmax = 4.0 (medium curve) and Tmax = 2.0 (thin curve). We see that the 
peculiar velocity of the gas serves to boost correlations on both large {v > lOOkm/s) and 
small scales {v < lOkm/s), but to suppress correlations on intermediate scales. The boost 
of correlation on small scales by the peculiar velocity produces velocity caustics and the 
boost on large scales due to peculiar velocity is essentially the "Kaiser Effect" (Kaiser 
1987). Since the total number of pairs has to be conserved, the correlation strength on 
intermediate scales is reduced. The thermal motion alone tends to reduce correlations on 
all scales. However, coupling of peculiar velocity and thermal motion is not simple. It 
appears that the correlation is slightly stronger at the intermediate scales v = 15 — 60km/s 
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when both effects are included than that with either of the effects alone. This result reflects 
the complicated interplay between thermal motion and peculiar velocity of the gas on the 
correlation. The result that the correlation function at small scales is reduced when both 
the thermal and velocity effects are included indicates that velocity caustics are largely 
removed by the thermal broadening effect. But overall, the two effects do not seem to 
significantly alter the true correlation of the clouds on scales > 20km/s. 

Figure 5 shows the "stepped optical depth correlation function" ^r,s('f) [defined by 
equation (6)] for S/N=50, FWHM=6km/s (thick curves) and S/N=10, FWHM=30km/s 
(thin curves). For each case three functions for three optical depth floor values are shown: 
'Tmin = 1-0 (solid curvc), Tmin — 2.0 (dottcd curve) and Tmin — 4.0 (dashed curve). We 
remind the reader that a higher value of selection of only higher optical 

depth regions. We see that, at r^m = 1-0 (solid curves), observational parameters S/N 
and FWHM do not affect the results on the scale > 30km/s, simply because the dominant 
contributions from regions of optical depths just above the floor value are resolution 
insensitive. Equivalently stated, these regions are already well resolved even by the quoted 
lower resolution observations. The situation is somewhat different for higher rain's, where 
higher S/N and lower FWHM give shghtly weaker correlations. This is because lower S/N 
and higher FWHM observations only pick out relatively higher density, high optical depth 
regions, which are intrinsically more strongly clustered. On small scales, the observational 
resolution dependence is seen for all three r^m's, where higher S/N and lower FWHM 
give stronger correlations as expected, which is, of course, directly related to the greater 
resolving power of the relevant dense regions in high resolution observations. Consistent 
with results shown in Figure 3, we find here that high optical depth, high density regions 
are more strongly clustered than lower optical depth, low density regions on all scales 
probed here, 1 — 320km/s. 
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Similar to Figure 4, Figure 6 shows separately the correlations ^r,s with only thermal 
broadening (dotted curves), with only the peculiar velocity effect (dashed curves) and 
with both effects (solid curves; the same as the thick curves in Figure 5). For each case 
three functions for three optical depth ceiling values are shown: Tmin — 4.0 (thick curve), 
Tmin — 2.0 (medium curve) and Tmin — 1-0 (thin curve). The reader is reminded that the 
main difference between ^ shown in Figures (3,4) and ^t-,s shown in Figures (5,6) is that 
^r,c measures the correlation by excluding high density, high optical depth regions, while 
^r,s measures the correlation by excluding low density, low optical depth regions. We see 
that the correlation with peculiar velocity alone has the highest amplitude, the one with 
thermal motion alone has the lowest amplitude, while the correlation with both effects is 
intermediate at all scales except v > lOOkm/s. This result is consistent with the fact that 
high density regions are more strongly clustered and peculiar velocities tend to enhance 
the correlation strength by, on average, decreasing the pairwise distance in velocity space 
(Kaiser 1987). 

Finally, we show in Figure 7 the line-line correlation function C,iine{v) [defined by 
equation (7)] for S/N=50, FWHM=6km/s (thick curves) and S/N=10, FWHM=30km/s 
(thin curves), with a flux threshold Ft — 0.6 (see MCOR). For each case three functions 
for two equivalent widths are shown: Wt = lOkm/s (sohd curve), and Wt = 40km/s 
(dashed curve). We stress that the correlations on scales < 100 — 200km/s are probably 
sensitive to the details of the line identification scheme. In particular, we believe that 
the sharp downturns of the fine correlation functions on scales < 100 — 200km/s are, in 
large part, likely due to the blending of lines adjacent in velocity space. However, the 
correlations on scales > 200km/s are perhaps more robust and less sensitive to specifics of 
a fine identification method. We therefore focus on scales 200 — 320km/s, where the upper 
limit, corresponding to a quarter of the box size, arises from the fact that our simulation 
box assumes periodic boundary conditions. We see that a higher Wt results in a stronger 
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correlation on scales > 200km/s. This trend is consistent with observations that there is 
a gradual progression in clustering strength from weak to strong with increasing column 
density (Crotts 1989; Cristiani et al. 1997). The results are shown to be insensitive to 
S/N and FWHM. To examine the dependence of the line-line correlation function on the 
flux threshold Ft, Figure 8 shows Cuneiv) at Ft — 0.4 (solid curve), 0.6 (dotted curve) and 

0. 8 (dashed curve), each with S/N=50, FWHM=6km/s and Wt = lOkm/s. We see that 
demanding a lower Ft is qualitatively equivalent to demanding a larger equivalent width 
Wt, as shown in Figure 7. 

The above separately displayed results are best presented as summarized in Table 

1, where we show the clustering strength of all the correlation measures at separations 
of 50,100,200 and 300km/s (at z = 3). All measures use S/N=50, FWHM=6km/s. 
For ^line, Wt = lOkm/s is used but we note that results with Wt = 40km/s are very 
similar. In addition, two measures for galaxy clustering are shown: ''Xgaiaxy' indicates the 
galaxy-galaxy correlation strength at z = 3 if we assume that the galaxies are (linearly) 
biased over dark matter by an amount equal to that at 2; = 0, i.e., b = a^^ = 1.27, where 
as is the density fluctuation in a top-hat sphere of radius 8/i~^Mpc at ^ = 0; ^stable is 
obtained by extrapolating the observed z — galaxy-galaxy two-point correlation function 
['C(O) = {r /5.5h~^Mpc)~^-^] to 2; = 3 assuming stable clustering (i.e. correlation function is 
fixed in real space). A few points become clear from the table. 

First, the cosmological model adopted is interesting in that "Cgaiaxy' is in agreement 
with ^stable at 2; = 3. However, if galaxies at 2; = 3 turn out to be much more strongly 
clustered than that indicated by ^galaxy, then a higher bias of galaxies over matter is required 
(if one adopts this particular cosmological model) and it implies that the stable clustering 
assumption is invalid, vice versa. Note that this model also reproduces all the observations 
at ^ = quantitatively well. Second, the flux correlation ^fiux does not represent the true 
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matter correlation well, partly because of its definition and partly because of the fact that 
it does not particularly reflect the correlation of dense regions (i.e. "clouds"). Third, the 
optical depth correlation directly using the observed optical depth [Cr,c(oo)] traces the 
true correlation of the underlying matter well with some moderate anti-bias at scales 
< lOOkm/s. Removing high optical depth regions significantly reduces the correlation on all 
scales; Cr,c(2) is about a factor of two smaller than ^^c(oo) at all scales. Fourth, the best 
representation of the matter correlation is achieved by ^r,s(2), which faithfully traces on 
all the scales shown. Since optical depth r — 2 should be easily identifiable with the current 
observational sensitivity without much distortion caused by noise, Cr,s(2) may represent the 
best correlational measure both to compare observations with simulations and to compare 
correlations of Lyct clouds with correlations of galaxies. We note that a Lyct cloud with 
a column density oi N — 1 x lO^^cm"^ and a Doppler width b — 25km s~^ has a central 
Lya optical depth of ~ 3.0. Therefore, C,t.s{2) measures the spatial correlation of Lya 
clouds with column densities > 10^^~^^/cm^. A Tmin less than 2.0 enables us to compute 
correlations of lower column density clouds. The essential difference between the optical 
depth based correlation measures and fiux based measures is that optical depth directly 
probes the densities of the clouds, thus a direct measure of density. Finally, we emphasize 
that the line-line correlation is highly dependent on the details of the line identification 
scheme. While a comparison between observations and simulations can always be made as 
long as both are analyzed in the same way using a chosen measure, the difference between 
the line-line correlational measure, which is almost the sole, used clustering indicator of 
observed Lyman forest correlations, and ^,-,5 (2), which represents an alternative to the 
line-line correlation measure, seems clear. The number of lines identified for a given "true" 
cloud (which is easy to see in simulations) or a given domain of observed spectra depends 
on observational resolution and noise as well as post-observation fitting procedures. Even in 
the limit of infinite observational sensitivity the number of lines determined is not unique. 
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due to the ambiguity in the hne fitting and deblending process. We note that ^t,s picks out 
high optical depth regions, gives each (uniform) spectral pixel in the selected region equal 
weight, and measures the correlation of these pixels. In essence, ^r,s is a line-line correlation, 
if one visualizes each selected spectral pixel as a "line" , but it lacks any ambiguity. 

While a direct comparison with available observations is not attempted due to 
the lack of a common measure, we see a broad consistency. Sargent et al. (1980) find 
that there is very weak correlation on scales 300 — 30, OOOkm/s with an observational 
FWHM~ 50km/s at ^ = 1.7 - 3.3. Webb (1986) finds a correlation strength of 0.32 ± 0.08 
for = 50 - 290km/s with FWHM~ 20km/s at ^ = 1.9 - 2.8. Meiksin & Bouchet 
(1995) find positive correlations, with a maximum amplitude of 0.5 — 1.0, on the scales 
Av = 100 - 600km/s with FWHM~ 25km/s at z = 2 - 4. Ranch et al. (1992) show that 
there is no strong clustering on all scales probed, Av > lOOkm/s with FWHM~ 20km/s 
Bit z — 2.7 — 3.4. Cristiani et al. (1997) conclude that there is significant clustering at 
Av < 300km/s with FWHM~ lOkm/s at z ~ 3.0, with ^ ~ 0.6 at Av = lOOkm/s for clouds 
with Nhi > lO^^'^cm"^, and no significant clustering for Nhi < lO^^'^cm"^. 



5. Discussion and Conclusions 

Examination of several measures of the correlation of Lya clouds and comparison 
between them and fundamental matter correlations shows that mass is more strongly 
clustered than that of Lya clouds, on all scales sit z — 3. It is also found that, among the 
Lya clouds, higher density, higher optical depth, higher column density regions are more 
strongly clustered than lower density, lower optical depth, lower column density regions, 
with the difference being larger at small separations and smaller at large separations. These 
effects should be observable even in the current observations. Although we are not able 
to make a direct comparison with available observations due to the lack of a common 
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measure, we see a broad consistency between our simulation results and observations. If we 
assume that galaxies form preferentially in regions whose densities are much higher than 
those of the Lya clouds, especially at the redshift considered here, then a consistent picture 
emerges: the correlation strength for a given set of objects is positively correlated with 
their characteristic global density and the differences among the correlations of galaxies, 
Lya clouds and mass reflect the differences in density that each trace. The property might 
be tracable to the nature of Gaussian density field where higher density peaks are more 
strongly clustered than lower density peaks (Bardeen et al. 1986). 

The true correlation strength of the matter is about 0.2 — 2.0 on scales 50 — 300km/s, 
where it can be measured reliably in our simulations, ai z — 3. Significant positive 
correlations with a strength of 0.1 — 1.0 are predicted for Lya clouds in the velocity 
range 50 — 300km/s. All correlation functions examined here decrease monotonically with 
increasing separation. Due to our limited box size (10/i~^Mpc) the computed correlation 
underestimates the true correlation. However, the underestimate is expected to be a 
factor less than two at f ~ 300 km/s and smaller at smaller scales. While our limited 
simulation box size hampers our ability to study the correlations of the Lya forest on scales 
> 2.5/i~^Mpc {Av — 320km/s), we expect that the correlation strength of Lya clouds on 
scales > 300km/s is small, < 0.1. The correlation strength on scales of several hundred 
km/sec measures the power of the density fluctuations on comoving megaparsec scales. 
Thus, an accurate determination of the correlation strength on scales of several hundred 
km/sec provides a unique data point for the power spectrum on comoving megaparsec 
scales. 

Among the measures examined, the "stepped optical depth correlation function" 
(Equation 5) seems to be the most useful correlational measure. It reasonably represents the 
true correlation of the underlying matter, thus enabling a better indication of both matter 
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correlation and the relationship between galaxies and Lya clouds. Moreover, it appears to 
provide a simple alternative to the conventional line-line correlation function with a few 
obviously advantageous features: 1) it does not require ambiguous post-observation line 
fitting and deblending procedures such as those commonly employed in the conventional 
line finding methods, 2) it does not depend sensitively on the observational resolution such 
as FWHM, once a "true" cloud (which is hard to define observationally but easy to identify 
and study in three-dimensional simulations) is resolved (this typically means that an 
FWHM is sufficiently smaller than the thermal broadening width of a cloud) and, 3) it can 
be easily measured with the current observational sensitivity without being contaminated 
significantly by the presence of noise, if one chooses an appropriate optical depth floor value 
Tmm, say, < 2.0. 

Applying the newly proposed correlational measures, particularly, iT,s{v) [defined by 
equation (5)], to observed spectra, both along the line of sight and in the perpendicular 
direction for nearby sightlines, will perhaps yield much more unambiguous results about the 
spatial clustering of the observed Lya clouds. Comparison of correlations between observed 
and model spectra employing such a simple unambiguous measure might enable a much 
simpler interpretation of some essential issues such as if the clouds are made of smaller 
dense cloudlets. 

Aside from the choice of a particular cosmological model which seems viable, the 
critical ingredient in our simulation is the assumption that the ionizing radiation field 
is spatially uniform. While it seems reasonable, future larger simulations with box size 
> 20/i~^Mpc are required in order to study correlations up to the scale 5/i~^Mpc. Such a 
study will shed hght on the origin of the correlations of Lya forest and possibly also on the 
properties of the meta-galactic UV background, when compared with observations. 
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Fig. 1. — shows ^pj(r) (solid curve) and ^p^(r) (dashed curve) . Note that the two correlations 
are computed in real space as a function of proper distance, as shown in the bottom horizontal 
axis. For reference velocity distance, converted from the proper distance using the relation 
V — r/H{z), where H{z) — 512km/s/Mpc is the Hubble constant at z = 3 for the adopted 
model, is shown in the top horizontal axis. Note that 1 + ^ rather than ^ is shown in the 
plot in order to display negative values of ^ in the logarithmic scale. 

Fig. 2. — shows the flux correlation function ^jiuxiv) [defined by equation (2)] for two 
sets of observational parameters, S/N=50, FWHM=6km/s (solid curve) and S/N=10, 
FWHM=30km/s (dashed curve). 

Fig. 3. — shows the "capped optical depth correlation function" ^T,c(f^) [defined by equation 
(3)] for two sets of observational parameters S/N=50, FWHM=6km/s (thick curves) and 
S/N=10, FWHM=30km/s (thin curves). For each case three functions for three optical 
depth ceiling values are shown: Tj^ax = oo (solid curve), r^ax = 4.0 (dotted curve) and 
Tmax = 2.0 (dashed curve). 

Fig. 4. — shows the "capped optical depth correlation function" Ct,c{'v) [defined by equation 
(3)] with only thermal broadening (dotted curves), only the pecuhar velocity effect (dashed 
curves) and both effects (solid curves; the same as the thick curves in Figure 3) for the case 
with S/N=50, FWHM=6km/s. For each case three functions for three optical depth ceiling 
values are shown: Tmax — oo (thick curve), Tmax — 4.0 (medium curve) and Tmax — 2.0 (thin 
curve) . 

Fig. 5. — shows the "stepped optical depth correlation function" ^t,s{v) [defined by equation 
(5)] for two sets of observational parameters S/N=50, FWHM=6km/s (thick curves) and 
S/N=10, FWHM=30km/s (thin curves). For each case three functions for three optical 
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depth floor values are shown: T^j„ = 1.0 (solid curve), T^i„ = 2.0 (dotted curve) and 
Tmin — 4.0 (dashed curve). 

Fig. 6. — shows the "stepped optical depth correlation function" CtA"^) [defined by equation 
(5)] with only thermal broadening (dotted curves), only the peculiar velocity effect (dashed 
curves) and both effects (solid curves; the same as the thick curves in Figure 5). For each 
case three functions for three optical depth ceiling values are shown: Tmin — 4.0 (thick curve), 
Tmin = 2.0 (medium curve) and Tmin = 1-0 (thin curve). 

Fig. 7. — shows the line-line correlation function Ciineiv) [defined by equation (7)] for 
two sets of observational parameters S/N=50, FWHM=6km/s (thick curves) and S/N=10, 
FWHM=30km/s (thin curves), with a ffux threshold Ft — 0.6 (see MCOR). For each case 
three functions for two equivalent widths are shown: Wt — lOkm/s (solid curve), and 
Wt — 40km/s (dashed curve). 

Fig. 8. — shows ^iine{v) at Ft — 0.4 (sohd curve), 0.6 (dotted curve) and 0.8 (dashed curve), 
each with S/N=50, FWHM=6km/s and Wt = lOkm/s. 
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Table 1. Summary of correlations 
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'^comoving length units 

''probably underestimated due to line blending 

'^galaxy correlation at z — 3 derived from assuming a linear density bias of 1.27 — l/as 

'^galaxy correlation at z — 3 derived by extrapolating present observed galaxy correlation function to 

z = 3 assuming stable clustering 
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